AITopics | improved uncertainty and adversarial robustness

Reverse KL-Divergence Training of Prior Networks: Improved Uncertainty and Adversarial Robustness

Neural Information Processing SystemsDec-25-2025, 15:16:15 GMT

Ensemble approaches for uncertainty estimation have recently been applied to the tasks of misclassification detection, out-of-distribution input detection and adversarial attack detection. Prior Networks have been proposed as an approach to efficiently emulate an ensemble of models for classification by parameterising a Dirichlet prior distribution over output distributions. These models have been shown to outperform alternative ensemble approaches, such as Monte-Carlo Dropout, on the task of out-of-distribution input detection. However, scaling Prior Networks to complex datasets with many classes is difficult using the training criteria originally proposed. This paper makes two contributions. First, we show that the appropriate training criterion for Prior Networks is the reverse KL-divergence between Dirichlet distributions. This addresses issues in the nature of the training data target distributions, enabling prior networks to be successfully trained on classification tasks with arbitrarily many classes, as well as improving out-of-distribution detection performance. Second, taking advantage of this new training criterion, this paper investigates using Prior Networks to detect adversarial attacks and proposes a generalized form of adversarial training. It is shown that the construction of successful adaptive whitebox attacks, which affect the prediction and evade detection, against Prior Networks trained on CIFAR-10 and CIFAR-100 using the proposed approach requires a greater amount of computational effort than against networks defended using standard adversarial training or MC-dropout.

improved uncertainty and adversarial robustness, kl-divergence training, name change, (5 more...)

Neural Information Processing Systems

Industry:

Information Technology > Security & Privacy (0.82)
Government > Military (0.82)

Technology:

Information Technology > Security & Privacy (0.82)
Information Technology > Artificial Intelligence > Machine Learning (0.76)

Add feedback

Reviews: Reverse KL-Divergence Training of Prior Networks: Improved Uncertainty and Adversarial Robustness

Neural Information Processing SystemsFeb-12-2025, 02:33:41 GMT

Detecting inputs that are outside the distribution of training examples, including adversarial inputs, is an important problem; reviewers and the area chair agree that this paper makes a useful algorithmic contribution towards solving this problem. The argument that reverse KL is conceptually correct, while forward KL as used previously is conceptually wrong, is significant. Training with reverse KL is a simple and compelling idea that practitioners can try easily. For these reasons the paper is being accepted so that the community can benefit from it quickly, despite the fact that reviewers have identified ways in which the writing of the paper, and the empirical evaluation, need improvement. The authors are encouraged to improve the final version.

improved uncertainty and adversarial robustness, kl-divergence training, reverse kl

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.71)

Add feedback

Reverse KL-Divergence Training of Prior Networks: Improved Uncertainty and Adversarial Robustness

Neural Information Processing SystemsJan-25-2025, 01:12:02 GMT

Ensemble approaches for uncertainty estimation have recently been applied to the tasks of misclassification detection, out-of-distribution input detection and adversarial attack detection. Prior Networks have been proposed as an approach to efficiently emulate an ensemble of models for classification by parameterising a Dirichlet prior distribution over output distributions. These models have been shown to outperform alternative ensemble approaches, such as Monte-Carlo Dropout, on the task of out-of-distribution input detection. However, scaling Prior Networks to complex datasets with many classes is difficult using the training criteria originally proposed. This paper makes two contributions.

detection, improved uncertainty and adversarial robustness, kl-divergence training, (2 more...)

Neural Information Processing Systems

Industry:

Information Technology > Security & Privacy (0.66)
Government > Military (0.66)

Technology:

Information Technology > Security & Privacy (0.66)
Information Technology > Artificial Intelligence > Machine Learning (0.40)

Add feedback

Reverse KL-Divergence Training of Prior Networks: Improved Uncertainty and Adversarial Robustness

Malinin, Andrey, Gales, Mark

Neural Information Processing SystemsMar-19-2020, 02:33:22 GMT

Ensemble approaches for uncertainty estimation have recently been applied to the tasks of misclassification detection, out-of-distribution input detection and adversarial attack detection. Prior Networks have been proposed as an approach to efficiently emulate an ensemble of models for classification by parameterising a Dirichlet prior distribution over output distributions. These models have been shown to outperform alternative ensemble approaches, such as Monte-Carlo Dropout, on the task of out-of-distribution input detection. However, scaling Prior Networks to complex datasets with many classes is difficult using the training criteria originally proposed. This paper makes two contributions.

detection, improved uncertainty and adversarial robustness, kl-divergence training, (2 more...)

Neural Information Processing Systems

Industry:

Information Technology > Security & Privacy (0.66)
Government > Military (0.66)

Technology:

Information Technology > Security & Privacy (0.66)
Information Technology > Artificial Intelligence > Machine Learning (0.43)

Add feedback

Filters

Collaborating Authors

improved uncertainty and adversarial robustness

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

Reverse KL-Divergence Training of Prior Networks: Improved Uncertainty and Adversarial Robustness

Reviews: Reverse KL-Divergence Training of Prior Networks: Improved Uncertainty and Adversarial Robustness

Reverse KL-Divergence Training of Prior Networks: Improved Uncertainty and Adversarial Robustness

Reverse KL-Divergence Training of Prior Networks: Improved Uncertainty and Adversarial Robustness